Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 2151 |
| Missing cells | 4303 |
| Missing cells (%) | 11.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 288.7 KiB |
| Average record size in memory | 137.4 B |
Variable types
| NUM | 9 |
|---|---|
| CAT | 6 |
| UNSUPPORTED | 2 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-05-21 15:05:52.294922 |
|---|---|
| Analysis finished | 2020-05-21 15:06:05.594872 |
| Duration | 13.3 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
number_of_reviews has constant value "0" | Constant |
name has a high cardinality: 2003 distinct values | High cardinality |
host_name has a high cardinality: 579 distinct values | High cardinality |
id is highly correlated with df_index | High correlation |
df_index is highly correlated with id | High correlation |
neighbourhood is highly correlated with neighbourhood_group | High correlation |
neighbourhood_group is highly correlated with neighbourhood | High correlation |
last_review has 2151 (100.0%) missing values | Missing |
reviews_per_month has 2151 (100.0%) missing values | Missing |
name is uniformly distributed | Uniform |
df_index has unique values | Unique |
id has unique values | Unique |
last_review is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
reviews_per_month is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
| Distinct count | 2151 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5406.978614597861 |
|---|---|
| Minimum | 18 |
| Maximum | 7906 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 1314.5 |
| Q1 | 4143 |
| median | 5866 |
| Q3 | 7105.5 |
| 95-th percentile | 7797.5 |
| Maximum | 7906 |
| Range | 7888 |
| Interquartile range (IQR) | 2962.5 |
Descriptive statistics
| Standard deviation | 2015.913291 |
|---|---|
| Coefficient of variation (CV) | 0.3728354473 |
| Kurtosis | -0.2194555785 |
| Mean | 5406.978615 |
| Median Absolute Deviation (MAD) | 1423 |
| Skewness | -0.7847043048 |
| Sum | 11630411 |
| Variance | 4063906.395 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6143 | 1 | < 0.1% | |
| 7563 | 1 | < 0.1% | |
| 5486 | 1 | < 0.1% | |
| 3439 | 1 | < 0.1% | |
| 6848 | 1 | < 0.1% | |
| 7541 | 1 | < 0.1% | |
| 7543 | 1 | < 0.1% | |
| 7545 | 1 | < 0.1% | |
| 5500 | 1 | < 0.1% | |
| 7549 | 1 | < 0.1% | |
| Other values (2141) | 2141 | 99.5% |
| Value | Count | Frequency (%) | |
| 18 | 1 | < 0.1% | |
| 23 | 1 | < 0.1% | |
| 26 | 1 | < 0.1% | |
| 29 | 1 | < 0.1% | |
| 36 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7906 | 1 | < 0.1% | |
| 7905 | 1 | < 0.1% | |
| 7904 | 1 | < 0.1% | |
| 7903 | 1 | < 0.1% | |
| 7902 | 1 | < 0.1% |
| Distinct count | 2151 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29489562.23059042 |
|---|---|
| Minimum | 355955 |
| Maximum | 38112762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 355955 |
|---|---|
| 5-th percentile | 11505047.5 |
| Q1 | 25793856 |
| median | 32224108 |
| Q3 | 35656696 |
| 95-th percentile | 37899069.5 |
| Maximum | 38112762 |
| Range | 37756807 |
| Interquartile range (IQR) | 9862840 |
Descriptive statistics
| Standard deviation | 8199213.277 |
|---|---|
| Coefficient of variation (CV) | 0.2780378092 |
| Kurtosis | 1.484029242 |
| Mean | 29489562.23 |
| Median Absolute Deviation (MAD) | 4152764 |
| Skewness | -1.36544138 |
| Sum | 6.343204836e+10 |
| Variance | 6.722709836e+13 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 22890495 | 1 | < 0.1% | |
| 37260651 | 1 | < 0.1% | |
| 29705550 | 1 | < 0.1% | |
| 21398866 | 1 | < 0.1% | |
| 4722005 | 1 | < 0.1% | |
| 35775238 | 1 | < 0.1% | |
| 27346264 | 1 | < 0.1% | |
| 29731489 | 1 | < 0.1% | |
| 37877086 | 1 | < 0.1% | |
| 23907679 | 1 | < 0.1% | |
| Other values (2141) | 2141 | 99.5% |
| Value | Count | Frequency (%) | |
| 355955 | 1 | < 0.1% | |
| 481789 | 1 | < 0.1% | |
| 642660 | 1 | < 0.1% | |
| 733863 | 1 | < 0.1% | |
| 768313 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 38112762 | 1 | < 0.1% | |
| 38110493 | 1 | < 0.1% | |
| 38109336 | 1 | < 0.1% | |
| 38108273 | 1 | < 0.1% | |
| 38105126 | 1 | < 0.1% |
| Distinct count | 2003 |
|---|---|
| Unique (%) | 93.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.8 KiB |
| Tasteful & Cozy 1 BR near SGH/Tiong Bahru | 8 |
|---|---|
| City-located 1BR loft apartment *BRAND NEW* | 8 |
| MODERN 2 BR APT @ CENTRAL/GEYLANG w/QUEEN BED | 6 |
| Stylish 1BR Located 7 mins from Tg Pagar MRT | 5 |
| Inviting & Cozy 1BR APT 3 mins from Tg Pagar MRT | 5 |
| Other values (1998) |
| Value | Count | Frequency (%) | |
| Tasteful & Cozy 1 BR near SGH/Tiong Bahru | 8 | 0.4% | |
| City-located 1BR loft apartment *BRAND NEW* | 8 | 0.4% | |
| MODERN 2 BR APT @ CENTRAL/GEYLANG w/QUEEN BED | 6 | 0.3% | |
| Stylish 1BR Located 7 mins from Tg Pagar MRT | 5 | 0.2% | |
| Inviting & Cozy 1BR APT 3 mins from Tg Pagar MRT | 5 | 0.2% | |
| Tasteful & Cozy 1 Bedroom Apt near SGH/Tiong Bahru | 4 | 0.2% | |
| KINEX/GEYLANG SERAI/EUNOS MRT "MODERN LOFT APT" | 4 | 0.2% | |
| Tasteful & Cozy Exe 1-BR Apt near SGH/Tiong Bahru | 4 | 0.2% | |
| City-located studio loft apartment *BRAND NEW* | 4 | 0.2% | |
| Gem of the West 1BR APT 3 mins from Jurong E MRT | 4 | 0.2% | |
| Other values (1993) | 2099 | 97.6% |
Length
| Max length | 92 |
|---|---|
| Median length | 42 |
| Mean length | 38.9986053 |
| Min length | 1 |
host_id
Real number (ℝ≥0)
| Distinct count | 690 |
|---|---|
| Unique (%) | 32.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 122111404.43654114 |
|---|---|
| Minimum | 228867 |
| Maximum | 288567551 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 228867 |
|---|---|
| 5-th percentile | 8492007 |
| Q1 | 33267200 |
| median | 111909043 |
| Q3 | 209260798 |
| 95-th percentile | 267763952 |
| Maximum | 288567551 |
| Range | 288338684 |
| Interquartile range (IQR) | 175993598 |
Descriptive statistics
| Standard deviation | 87335732.01 |
|---|---|
| Coefficient of variation (CV) | 0.7152135577 |
| Kurtosis | -1.321337745 |
| Mean | 122111404.4 |
| Median Absolute Deviation (MAD) | 82488190 |
| Skewness | 0.1983563524 |
| Sum | 2.626616309e+11 |
| Variance | 7.627530085e+15 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 66406177 | 109 | 5.1% | |
| 209913841 | 108 | 5.0% | |
| 219550151 | 74 | 3.4% | |
| 8492007 | 68 | 3.2% | |
| 159804766 | 66 | 3.1% | |
| 201034613 | 64 | 3.0% | |
| 156409670 | 61 | 2.8% | |
| 138649185 | 61 | 2.8% | |
| 108773366 | 45 | 2.1% | |
| 151196270 | 42 | 2.0% | |
| Other values (680) | 1453 | 67.5% |
| Value | Count | Frequency (%) | |
| 228867 | 1 | < 0.1% | |
| 229339 | 3 | 0.1% | |
| 519472 | 1 | < 0.1% | |
| 581033 | 1 | < 0.1% | |
| 646629 | 4 | 0.2% |
| Value | Count | Frequency (%) | |
| 288567551 | 1 | < 0.1% | |
| 288546201 | 1 | < 0.1% | |
| 288249975 | 1 | < 0.1% | |
| 288110467 | 1 | < 0.1% | |
| 288016519 | 2 | 0.1% |
| Distinct count | 579 |
|---|---|
| Unique (%) | 26.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.8 KiB |
| Jay | 112 |
|---|---|
| Richards | 108 |
| Alvin | 76 |
| Rain | 74 |
| Xiaoyu | 66 |
| Other values (574) |
| Value | Count | Frequency (%) | |
| Jay | 112 | 5.2% | |
| Richards | 108 | 5.0% | |
| Alvin | 76 | 3.5% | |
| Rain | 74 | 3.4% | |
| Xiaoyu | 66 | 3.1% | |
| Marcelo | 64 | 3.0% | |
| Rajan | 61 | 2.8% | |
| Paridhi | 61 | 2.8% | |
| RedDoorz | 45 | 2.1% | |
| Jhian | 42 | 2.0% | |
| Other values (569) | 1442 | 67.0% |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 6.062296606 |
| Min length | 1 |
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.8 KiB |
| Central Region | |
|---|---|
| West Region | 116 |
| East Region | 108 |
| North-East Region | 88 |
| North Region | 75 |
| Value | Count | Frequency (%) | |
| Central Region | 1764 | 82.0% | |
| West Region | 116 | 5.4% | |
| East Region | 108 | 5.0% | |
| North-East Region | 88 | 4.1% | |
| North Region | 75 | 3.5% |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 13.74058577 |
| Min length | 11 |
| Distinct count | 39 |
|---|---|
| Unique (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.8 KiB |
| Geylang | |
|---|---|
| Kallang | |
| Novena | 202 |
| Bukit Merah | 163 |
| Rochor | 131 |
| Other values (34) |
| Value | Count | Frequency (%) | |
| Geylang | 310 | 14.4% | |
| Kallang | 290 | 13.5% | |
| Novena | 202 | 9.4% | |
| Bukit Merah | 163 | 7.6% | |
| Rochor | 131 | 6.1% | |
| Downtown Core | 124 | 5.8% | |
| Outram | 111 | 5.2% | |
| Queenstown | 84 | 3.9% | |
| River Valley | 80 | 3.7% | |
| Bedok | 77 | 3.6% | |
| Other values (29) | 579 | 26.9% |
Length
| Max length | 23 |
|---|---|
| Median length | 7 |
| Mean length | 8.470943747 |
| Min length | 5 |
latitude
Real number (ℝ≥0)
| Distinct count | 1808 |
|---|---|
| Unique (%) | 84.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3143614644351465 |
|---|---|
| Minimum | 1.24387 |
| Maximum | 1.45328 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 1.24387 |
|---|---|
| 5-th percentile | 1.27579 |
| Q1 | 1.294815 |
| median | 1.31118 |
| Q3 | 1.322365 |
| 95-th percentile | 1.380335 |
| Maximum | 1.45328 |
| Range | 0.20941 |
| Interquartile range (IQR) | 0.02755 |
Descriptive statistics
| Standard deviation | 0.03144324337 |
|---|---|
| Coefficient of variation (CV) | 0.0239228281 |
| Kurtosis | 4.231053884 |
| Mean | 1.314361464 |
| Median Absolute Deviation (MAD) | 0.01358 |
| Skewness | 1.774095844 |
| Sum | 2827.19151 |
| Variance | 0.0009886775538 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1.31141 | 6 | 0.3% | |
| 1.31056 | 5 | 0.2% | |
| 1.31538 | 4 | 0.2% | |
| 1.3142 | 4 | 0.2% | |
| 1.31019 | 4 | 0.2% | |
| 1.30122 | 4 | 0.2% | |
| 1.31441 | 4 | 0.2% | |
| 1.31999 | 4 | 0.2% | |
| 1.31521 | 4 | 0.2% | |
| 1.30312 | 4 | 0.2% | |
| Other values (1798) | 2108 | 98.0% |
| Value | Count | Frequency (%) | |
| 1.24387 | 1 | < 0.1% | |
| 1.24847 | 1 | < 0.1% | |
| 1.24881 | 1 | < 0.1% | |
| 1.24992 | 1 | < 0.1% | |
| 1.26603 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.45328 | 1 | < 0.1% | |
| 1.44893 | 1 | < 0.1% | |
| 1.44787 | 1 | < 0.1% | |
| 1.44663 | 1 | < 0.1% | |
| 1.44622 | 1 | < 0.1% |
longitude
Real number (ℝ≥0)
| Distinct count | 1877 |
|---|---|
| Unique (%) | 87.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.84950771269177 |
|---|---|
| Minimum | 103.68536 |
| Maximum | 103.96751 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 103.68536 |
|---|---|
| 5-th percentile | 103.767365 |
| Q1 | 103.83471 |
| median | 103.85025 |
| Q3 | 103.867015 |
| 95-th percentile | 103.909515 |
| Maximum | 103.96751 |
| Range | 0.28215 |
| Interquartile range (IQR) | 0.032305 |
Descriptive statistics
| Standard deviation | 0.03948268343 |
|---|---|
| Coefficient of variation (CV) | 0.0003801913394 |
| Kurtosis | 2.156138436 |
| Mean | 103.8495077 |
| Median Absolute Deviation (MAD) | 0.01567 |
| Skewness | -0.7331024797 |
| Sum | 223380.2911 |
| Variance | 0.001558882291 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 103.85337 | 4 | 0.2% | |
| 103.85366 | 4 | 0.2% | |
| 103.84503 | 4 | 0.2% | |
| 103.83226 | 3 | 0.1% | |
| 103.86143 | 3 | 0.1% | |
| 103.83365 | 3 | 0.1% | |
| 103.83315 | 3 | 0.1% | |
| 103.84281 | 3 | 0.1% | |
| 103.87808 | 3 | 0.1% | |
| 103.85452 | 3 | 0.1% | |
| Other values (1867) | 2118 | 98.5% |
| Value | Count | Frequency (%) | |
| 103.68536 | 1 | < 0.1% | |
| 103.69103 | 1 | < 0.1% | |
| 103.69438 | 1 | < 0.1% | |
| 103.69746 | 1 | < 0.1% | |
| 103.69894 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 103.96751 | 1 | < 0.1% | |
| 103.96724 | 1 | < 0.1% | |
| 103.96605 | 1 | < 0.1% | |
| 103.96112 | 1 | < 0.1% | |
| 103.96073 | 1 | < 0.1% |
room_type
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.8 KiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 87 |
| Value | Count | Frequency (%) | |
| Entire home/apt | 1265 | 58.8% | |
| Private room | 799 | 37.1% | |
| Shared room | 87 | 4.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.72384937 |
| Min length | 11 |
price
Real number (ℝ≥0)
| Distinct count | 254 |
|---|---|
| Unique (%) | 11.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 172.8847047884705 |
|---|---|
| Minimum | 15 |
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 85 |
| median | 140 |
| Q3 | 225 |
| 95-th percentile | 400 |
| Maximum | 999 |
| Range | 984 |
| Interquartile range (IQR) | 140 |
Descriptive statistics
| Standard deviation | 131.0531286 |
|---|---|
| Coefficient of variation (CV) | 0.7580377267 |
| Kurtosis | 6.624034983 |
| Mean | 172.8847048 |
| Median Absolute Deviation (MAD) | 65 |
| Skewness | 2.132478563 |
| Sum | 371875 |
| Variance | 17174.92251 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 131 | 63 | 2.9% | |
| 287 | 52 | 2.4% | |
| 200 | 51 | 2.4% | |
| 150 | 49 | 2.3% | |
| 100 | 48 | 2.2% | |
| 69 | 40 | 1.9% | |
| 60 | 39 | 1.8% | |
| 181 | 39 | 1.8% | |
| 137 | 37 | 1.7% | |
| 300 | 37 | 1.7% | |
| Other values (244) | 1696 | 78.8% |
| Value | Count | Frequency (%) | |
| 15 | 1 | < 0.1% | |
| 19 | 7 | 0.3% | |
| 21 | 4 | 0.2% | |
| 22 | 7 | 0.3% | |
| 24 | 8 | 0.4% |
| Value | Count | Frequency (%) | |
| 999 | 1 | < 0.1% | |
| 950 | 1 | < 0.1% | |
| 900 | 2 | 0.1% | |
| 890 | 1 | < 0.1% | |
| 881 | 1 | < 0.1% |
minimum_nights
Real number (ℝ≥0)
| Distinct count | 64 |
|---|---|
| Unique (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.394235239423523 |
|---|---|
| Minimum | 1 |
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 30 |
| 95-th percentile | 120 |
| Maximum | 1000 |
| Range | 999 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 57.75354087 |
|---|---|
| Coefficient of variation (CV) | 1.839622479 |
| Kurtosis | 46.26420689 |
| Mean | 31.39423524 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 4.76447256 |
| Sum | 67529 |
| Variance | 3335.471483 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1 | 377 | 17.5% | |
| 3 | 359 | 16.7% | |
| 90 | 234 | 10.9% | |
| 30 | 199 | 9.3% | |
| 2 | 176 | 8.2% | |
| 7 | 125 | 5.8% | |
| 5 | 90 | 4.2% | |
| 18 | 72 | 3.3% | |
| 180 | 64 | 3.0% | |
| 20 | 54 | 2.5% | |
| Other values (54) | 401 | 18.6% |
| Value | Count | Frequency (%) | |
| 1 | 377 | 17.5% | |
| 2 | 176 | 8.2% | |
| 3 | 359 | 16.7% | |
| 4 | 24 | 1.1% | |
| 5 | 90 | 4.2% |
| Value | Count | Frequency (%) | |
| 1000 | 1 | < 0.1% | |
| 365 | 18 | 0.8% | |
| 360 | 2 | 0.1% | |
| 240 | 1 | < 0.1% | |
| 210 | 1 | < 0.1% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.8 KiB |
| 0 |
|---|
| Value | Count | Frequency (%) | |
| 0 | 2151 | 100.0% |
calculated_host_listings_count
Real number (ℝ≥0)
| Distinct count | 55 |
|---|---|
| Unique (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.88656438865644 |
|---|---|
| Minimum | 1 |
| Maximum | 274 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 31 |
| Q3 | 79 |
| 95-th percentile | 274 |
| Maximum | 274 |
| Range | 273 |
| Interquartile range (IQR) | 76 |
Descriptive statistics
| Standard deviation | 72.44267123 |
|---|---|
| Coefficient of variation (CV) | 1.209664838 |
| Kurtosis | 1.721529031 |
| Mean | 59.88656439 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | 1.520368878 |
| Sum | 128816 |
| Variance | 5247.940615 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1 | 384 | 17.9% | |
| 2 | 118 | 5.5% | |
| 274 | 109 | 5.1% | |
| 157 | 108 | 5.0% | |
| 67 | 82 | 3.8% | |
| 45 | 79 | 3.7% | |
| 113 | 74 | 3.4% | |
| 3 | 74 | 3.4% | |
| 203 | 68 | 3.2% | |
| 79 | 66 | 3.1% | |
| Other values (45) | 989 | 46.0% |
| Value | Count | Frequency (%) | |
| 1 | 384 | 17.9% | |
| 2 | 118 | 5.5% | |
| 3 | 74 | 3.4% | |
| 4 | 40 | 1.9% | |
| 5 | 34 | 1.6% |
| Value | Count | Frequency (%) | |
| 274 | 109 | 5.1% | |
| 203 | 68 | 3.2% | |
| 157 | 108 | 5.0% | |
| 141 | 24 | 1.1% | |
| 114 | 42 | 2.0% |
availability_365
Real number (ℝ≥0)
| Distinct count | 286 |
|---|---|
| Unique (%) | 13.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 282.465829846583 |
|---|---|
| Minimum | 1 |
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 48 |
| Q1 | 228.5 |
| median | 342 |
| Q3 | 364 |
| 95-th percentile | 365 |
| Maximum | 365 |
| Range | 364 |
| Interquartile range (IQR) | 135.5 |
Descriptive statistics
| Standard deviation | 110.4890255 |
|---|---|
| Coefficient of variation (CV) | 0.3911589079 |
| Kurtosis | -0.06903189666 |
| Mean | 282.4658298 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -1.175105108 |
| Sum | 607584 |
| Variance | 12207.82476 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 365 | 499 | 23.2% | |
| 364 | 202 | 9.4% | |
| 362 | 60 | 2.8% | |
| 363 | 52 | 2.4% | |
| 358 | 46 | 2.1% | |
| 180 | 32 | 1.5% | |
| 359 | 29 | 1.3% | |
| 179 | 29 | 1.3% | |
| 332 | 25 | 1.2% | |
| 90 | 23 | 1.1% | |
| Other values (276) | 1154 | 53.6% |
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.2% | |
| 2 | 4 | 0.2% | |
| 3 | 3 | 0.1% | |
| 4 | 4 | 0.2% | |
| 5 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 365 | 499 | 23.2% | |
| 364 | 202 | 9.4% | |
| 363 | 52 | 2.4% | |
| 362 | 60 | 2.8% | |
| 361 | 21 | 1.0% |
price_range
Categorical
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.9 KiB |
| 150 | |
|---|---|
| 100 | |
| 200 | |
| 50 | |
| 300 | |
| Other values (14) |
| Value | Count | Frequency (%) | |
| 150 | 493 | 22.9% | |
| 100 | 439 | 20.4% | |
| 200 | 357 | 16.6% | |
| 50 | 255 | 11.9% | |
| 300 | 216 | 10.0% | |
| 250 | 189 | 8.8% | |
| 350 | 67 | 3.1% | |
| 400 | 30 | 1.4% | |
| 450 | 18 | 0.8% | |
| 500 | 18 | 0.8% | |
| Other values (9) | 68 | 3.2% |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.880520688 |
| Min length | 3 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | price_range | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 18 | 355955 | Double room in an Authentic Peranakan Shophouse | 1759905 | Aresha | Central Region | Geylang | 1.31420 | 103.90232 | Private room | 81 | 90 | 0 | NaN | NaN | 1 | 173 | 100 |
| 1 | 23 | 481789 | Master Bedroom in Newly Built Flat | 2386154 | Susan | East Region | Tampines | 1.34816 | 103.93238 | Private room | 37 | 180 | 0 | NaN | NaN | 1 | 365 | 50 |
| 2 | 26 | 642660 | BEST CITY LIVING WITH GA RESIDENCE | 3212572 | Roger | Central Region | Rochor | 1.30109 | 103.85234 | Private room | 167 | 180 | 0 | NaN | NaN | 1 | 365 | 200 |
| 3 | 29 | 733863 | Homestay at Serangoon | 3824517 | Shirlnet | North-East Region | Serangoon | 1.36743 | 103.87288 | Private room | 26 | 180 | 0 | NaN | NaN | 1 | 365 | 50 |
| 4 | 36 | 768313 | Common Room for rent immediate | 4053150 | Immellymel | North-East Region | Punggol | 1.39963 | 103.90640 | Private room | 167 | 1 | 0 | NaN | NaN | 1 | 365 | 200 |
| 5 | 37 | 782227 | Cosy, bright, tasefully furnished | 4125828 | Jenny | Central Region | Queenstown | 1.28342 | 103.78585 | Private room | 128 | 3 | 0 | NaN | NaN | 1 | 365 | 150 |
| 6 | 43 | 823571 | Apartment away from town | 4177147 | Tania | East Region | Bedok | 1.31313 | 103.91479 | Private room | 278 | 1 | 0 | NaN | NaN | 1 | 365 | 300 |
| 7 | 46 | 880846 | 1 bedrm Aptm by Farrer Park Mrt | 4659563 | Gin | Central Region | Kallang | 1.31357 | 103.85731 | Private room | 100 | 365 | 0 | NaN | NaN | 1 | 365 | 100 |
| 8 | 82 | 1562453 | Deluxe Quad-sharing room | 8270362 | Domus | Central Region | Novena | 1.32359 | 103.84980 | Private room | 31 | 108 | 0 | NaN | NaN | 1 | 365 | 50 |
| 9 | 83 | 1581224 | Life Impact Coaching | 8408341 | Gerard | Central Region | Kallang | 1.31118 | 103.87367 | Entire home/apt | 131 | 14 | 0 | NaN | NaN | 1 | 365 | 150 |
Last rows
| df_index | id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | price_range | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2141 | 7897 | 38092051 | New Small Room @Orchard/Somerset/Central Area | 262337792 | Herman | Central Region | River Valley | 1.29482 | 103.83809 | Private room | 40 | 9 | 0 | NaN | NaN | 11 | 258 | 50 |
| 2142 | 7898 | 38092142 | Well connected 2 bedroom 2 bathroom apartment ! | 223622603 | Tanya | Central Region | Rochor | 1.30082 | 103.84956 | Entire home/apt | 200 | 2 | 0 | NaN | NaN | 4 | 347 | 200 |
| 2143 | 7899 | 38094671 | SMALL ROOM FOR ONE @SOMERSET/ORCHARD/CENTRAL AREA | 262337792 | Herman | Central Region | River Valley | 1.29369 | 103.83768 | Private room | 33 | 7 | 0 | NaN | NaN | 11 | 359 | 50 |
| 2144 | 7900 | 38102097 | 环境优雅的公寓 | 286260560 | Bo | West Region | Bukit Batok | 1.35654 | 103.76028 | Private room | 90 | 3 | 0 | NaN | NaN | 1 | 83 | 100 |
| 2145 | 7901 | 38104971 | 2 PAX LOFT Close To Kent Ridge Park | 278109833 | Belle | Central Region | Queenstown | 1.27973 | 103.78751 | Entire home/apt | 100 | 3 | 0 | NaN | NaN | 31 | 61 | 100 |
| 2146 | 7902 | 38105126 | Loft 2 pax near Haw Par / Pasir Panjang. Free Wifi | 278109833 | Belle | Central Region | Queenstown | 1.27973 | 103.78751 | Entire home/apt | 100 | 3 | 0 | NaN | NaN | 31 | 61 | 100 |
| 2147 | 7903 | 38108273 | 3bedroom luxury at Orchard | 238891646 | Neha | Central Region | Tanglin | 1.29269 | 103.82623 | Entire home/apt | 550 | 6 | 0 | NaN | NaN | 34 | 365 | 550 |
| 2148 | 7904 | 38109336 | [ Farrer Park ] New City Fringe CBD Mins to MRT | 281448565 | Mindy | Central Region | Kallang | 1.31286 | 103.85996 | Private room | 58 | 30 | 0 | NaN | NaN | 3 | 173 | 100 |
| 2149 | 7905 | 38110493 | Cheap Master Room in Central of Singapore | 243835202 | Huang | Central Region | River Valley | 1.29543 | 103.83801 | Private room | 56 | 14 | 0 | NaN | NaN | 2 | 30 | 100 |
| 2150 | 7906 | 38112762 | Amazing room with private bathroom walk to Orchard | 28788520 | Terence | Central Region | River Valley | 1.29672 | 103.83325 | Private room | 65 | 90 | 0 | NaN | NaN | 7 | 365 | 100 |